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© A branch prediction control circuit has a register. 
A main memory provides a machine language in- 
struction to the register and an instruction buffer 
(101). The prediction control circuit decodes the 
instruction held in the register and predicts whether 
or not the branch is taken. If the instruction is a 
relative branch instruction, a sum of the address of 
the instruction and a displacement included in the 
instruction is computed to provide a branch address. 
A the same time, a branch address valid signal is 
provided. With this arrangement, reliable branch pre- 
diction of the relative branch instruction is realized 
with a small number of hardware. 
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BRANCH CONTROL CIRCUIT 



BACKGROUND OF THE INVENTION 



1 . Field of the Invention 

The present invention relates to a branch con- 
trol circuit used for pipeline processing of instruc- 
tions, and particularly to a branch control circuit 
that can minimize confusion caused by branching 
in the pipeline processing. 
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In these days, many data processors are em- 
ploying pipeline processing to process and execute 
instructions at a high speed. 

The pipeline processing involves, for example, 
an instruction fetching stage, an instruction decod- 
ing stage, an effective address generating stage, 
an operand fetching stage, and an instruction ex- 
ecuting stage. Instructions are successively read 
from a memory and sequentially processed 
through these pipeline stages. Each of the stages 
processes a different instruction, and if there is no 
confusion in a flow in the pipeline stages, the 
system provides maximum performance. 

Supposing a data processor consumes one 
machine cycle in each processing stage, it requires 
five machine cycles for processing one instruction 
in a data processor without the pipeline system. On 
the other hand, a data processor employing the 
pipeline processing can process instructions five 
times as fast as the non-pipeline processor, be- 
cause the pipeline processor executes one instruc- 
tion in each machine cycle if no confusion occurs 
in a flow of the pipeline processing. 

According to a conventional pipeline technique, 
however, a branch instruction may cause confusion 
in the pipeline processing. After executing the 
branch instruction, the processor must invalidate 
instructions that are following the branch instruction 
and being processed in the pipeline stages, and 
restart processing from an target instruction of the 
branch. This deteriorates performance of the pipe- 
line. 

To deal with the problem, a processor employ- 
ing a branch prediction method has been pro- 
posed. This method predicts whether or not a 
branch instruction is taken,, according to branch 
prediction information. If the branch instruction is 
predicted to be taken, the processor starts to pro- 
cess an instruction addressed by the branch in- 
struction before executing the branch instruction, 
thereby reducing confusion to occur in the pipeline. 
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According to this technique, some means must 
be provided to find an target address of the branch 
instruction. One means to find the address is a 
look-up table which is referred to in the instruction 
s fetching stage of the pipeline. 

Figure 1 shows flows of instructions in pipeline 
stages of a conventional branch control circuit em- 
ploying the look-up table. 

The look-up table contains addresses of branch 
w instructions Bis and addresses of target instruc- 
tions TIs. The look-up table is referred to in the 
instruction fetching stage or instruction decoding 

3 toy 6 Oi ui6 pipeline lO find Oil Uuyei address of a 

branch instruction Bl. If the given branch instruction 

is Bl is predicted that branch is taken, an instruction 
located at an target address obtainable from the 
look-up table is fetched. This technique may mini- 
mize confusion to occur in the pipeline, as shown 
in Fig. 1 , if the branch prediction hits. 

20 The look-up table and its contents inevitably 

increase, however, the size of hardware. In other 
words, the number of entries of the look-up table is 
limited by hardware. 

If the number of entries of the look-up table is 

25 small, a mis-hit rate will rise in referring to the look- 
up table. This makes the prediction of a branch 
instruction Bl impossible. 

In Fig. 2, flows of a pipeline process with 
unsuccessful branch prediction are shown. As 

oo shown in Fig. 2, since an instruction fetch of the 
target instructions Tis is carried out after executing 
the branch instruction Bl, the process after the first 
time requires more four cycles than that at the first 
time. 

35 In this way, the conventional branching control 

technique for a pipeline system requires the size of 
. hardware (the size of the look-up table) to be 
sufficiently large to increase a hit rate of branch 
prediction. If the number of entries of the look-up 

40 table is small, a mis-hit rate increases to cause 
confusion in the pipeline. 



SUMMARY OF THE INVENTION 

45 

To solve these problems, an object of the 
present invention is to provide a branch control 
circuit that can minimize confusion of a pipeline 
due to a branch instruction to occur, without in- 
so creasing hardware. 

In order to accomplish the object, the present 
invention provides a branch prediction control cir- 
cuit comprising a first register for holding at least 
one machine language instruction fetched from a 
memory; a second register for holding an address 
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of the machine language instruction; a first detec- 
tion circuit connected to the first register to detect 
that the first register contains a relative branch 
instruction that is predicted to be taken, and pro- 
vide a branch address valid signal; and an adder 
for adding a dispiacement inciuded in the relative 
branch instruction to the address held in the sec- 
ond register and providing a branch target address. 

According to a branch control circuit of the 
present invention, a sub-decoding circuit is ar- 
ranged outside a main decoding circuit. The sub- 
decoding circuit handles only a short format rela- 
tive branch instruction that frequently appears, to 
predict whether or not the branch instruction is 
taken. A branch address is "found by adding an 
sddress of the restive branch instruction to a dis- 
piacement contained in the branch instruction. This 
branch control circuit of the present invention can 
be formed with small hardware. Since the present 
invention does not require branch addresses to be 
stored, a memory for storing the branch addresses 
can be omitted, and confusion caused by success- 
ful branching in a pipeline can be minimized. 

These and other objects, features and advan- 
tages of the present invention will be more appar- 
ent from the following detailed description of pre- 
ferred embodiments in conjunction with the accom- 
panying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a view showing flows of a pipeline 
process with successful branch prediction, ac- 
cording to a conventionaJ branch control tech- 
nique; 

Fig. 2 is a view showing flows of a pipeline 
process with unsuccessful branch prediction, ac- 
cording to the conventional branch control tech- 
nique; 

Fig. 3 is a block diagram showing a basic ar- 
rangement of a branch control circuit according 
to the present invention; 

Fig. 4 is a view showing a format of instruction 
used by the branch control circuit of the present 
invention; 

Fig. 5 is a view showing various formats of 
relative branch instruction used by the branch 
control circuit of the present invention; 
Fig. 6 is a detailed circuit diagram showing a 
sub-decoding circuit of Fig. 3; 
Fig. 7 is a tabl showing relations between 
branch prediction information, bit status s of a 
fetching PC, and selector outputs of Fig. 6; 
Fig. 8 is a vi w showing flows of a pipeline 
process and explaining a branch predicting op- 
eration of the sub-decoding circuit 
Fig. 9 is a detailed circuit diagram showing main 
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decoding circuit of Fig. 3; 

Fig. 1 0 is a table showing relations of bit 
statuses of a field SS and selector outputs; and 
Fig. 1 1 is a view showing flows of a pipeline 
5 process and explaining a branch predicting op- 
eration of the rnain decoding circuit. 

DETAILED DESCRIPTION OF THE EMBODI- 
MENTS 

w 

Figure 3 shows a branch control circuit accord- 
ing to an embodiment of the present invention. 

The branch control circuit comprises an in- 
struction buffer 1 01 for holding instructions that are 

is provided in an instruction fetching stage of a pipe- 
line process, a sub-decoding circuit 102 for decod- 
ing only a short format relative branch instruction, 
and a main decoding circuit 103 for decoding a 
normaJ-type relative branch instruction. 

20 In operation, the sub-decoding circuit 102 de- 

codes only a short format relative branch instruc- 
tion, for branch prediction. The main decoding cir- 
cuit 103 receives first one of instructions stored in 
the instruction buffer 101 through a signal line 106. 

25 and decodes into an internal instruction, which is 
provided to a signal line 109 for later processes. 
Also, the main decoding circuit 103 predicts wheth- 
er or not a general format relative branch instruc- 
tion is taken, and provides a branch target address 

30 to a signal line 111 and a branch address valid 
signal to a signal line 110. 

Figure 4 shows an instruction format employed 
by the present invention. A basic unit of this format 
is one half word (HW) and has a variable length. An 

35 instruction of one half word" comprises a field Bl 
containing branch prediction information, and a 
field INST containing an instruction function and 
operand information. In this embodiment the field 
Bl comprises one bit, which will be "1 " when the 

40 field INST following the field Bl is a head of a 
branch instruction and when the branch instruction 
is predicted to be taken, and will be "0" in other 
cases. The field INST comprises sixteen bits. 

A instruction fetch circuit provides instructions 

46 of aligned four half words to the instruction buffer 
101 in one cycle. The instruction buffer 101 is 
capable of storing, for example, eight half words of 
instructions, and provides first three hatf words of 
instructions to the main decoding circuit 1 03 

so through the line 106. 

Figure 5 shows formats of short format and 
general format relative branch instructions ~m- 
ployed by th prediction control circuit of the 
present invention. A sum of the address of a 

55 branch instruction and a displacement contained in 
the branch instruction is used as a branch target 
address. 

The short format relative branch instruction is 
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one half word in length. Bits 0 to 7 of field INST of 
this instruction represent an operation code (OP 
code), and bits 8 to 15 represent a displacement 
(#d8). 

A double of the displacement #d8 define a 
difference between the address of the branch in- 
struction and the branch target address. 

The general format relative branch instruction 
may include a displacemnt of eight, sixteen, or 
thirty-two bits. If the displacement size is eight bits, 
a field SS of the instruction will be "OCT. If the 
displacement size is sixteen bits, the field SS of 
the instruction will be "01 n . In this case, the in- 
struction comprises two half words, in which the 
first one indicates a function of the instruction, and 
the second one indicates a displacement If the 
displacement size is thirty-two bits, the field SS of 
the instruction will be "10". In this case, the in- 
struction comprises three half words, in which the 
first one indicates a function of the instruction, the 
second one indicates upper sixteen bits of a dis- 
placement and the third one indicates lower six- 
teen bits of the displacement 

The sub-decoding circuit 102 handles only the 
short format relative branch instruction, and pre- 
dicts whether or not the branch is taken. 

Figure 6 is a detailed circuit diagram showing 
the sub-decoding circuit 102 of Fig. 3. 

The sub-decoding circuit 102 comprises an 
instruction latch 401 , a fetching PC 402 f a selector 
403, a branch instruction decoding circuit 404, and 
an adder 405. 

In operation, instructions of four half words are 
read from the instruction fetch circuit through the 
line 104, and stored in the instruction latch 401. A 
first address of the read instructions is stored in the 
fetching PC 402 through a line 105. Since a mini- 
mum instruction unit is a half word, a bit 31 of the 
fetching PC 402 is always n 0 n . 

Instructions are aligned in four half words. An 
instruction with bits 29 and 30 of its address being 
"00" is stored in a field INST[0-15] of the instruc- 
tion latch 401, an instruction with bits 29 and 30 
being "01" in a field INSH16-31], an instruction 
with bits 29 and 30 being "10" in a field INST[32- 
47], and an instruction with bits 29 and 30 being 
"ll n in a field INST[48-63]. When instructions are 
successively fetched, the bits 29 and 30 of the 
fetching PC 402 are "00* and all four half words of 
the instruction. latch 401 are valid. 

When a branch target instruction is fetched, the 
bits 29 and 30 of the fetching PC 402 indicate a 
branch target address, and In the instruction latch 
401, only the fields after the branch address are 
valid. For xample, if the bits 29 and 30 of the 
fetching PC 402 ar w 10 n , the fields INST[32-47] 
and INST[4&-63] and fields Bl[2] and Bl[3] ar 
valid. 
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The selector 403 selects one of the four half 
words stored in the instruction latch 401 according 
to bits of the fields Bl[0] to Bl[3] of the instruction 
latch 401 and the bits 29 and 30 of the fetching PC 

5 402. The selector 403 then provides the selected 
half word to the branch instruction decoding circuit 
404 and adder 405. 

Figure 7 is a table showing statuses of the bits 
29 and 30. of the fetching PC 402 and statuses of 

70 the branch prediction information bits Bl[0] to Bl[3] 
of the instruction latch 401. and outputs of the 
selector 403 corresponding to these statuses. Sup- 
posing the bits 29 and 30 of the fetching PC 402 
are "10" with Bl[0] = 0. Bl[1] = 0 and Bl[2] = 1, the 

rs selector 403 selects the field INST[32-47] of the 
instruction latch 401 . 

if the latcn 401 is storing a orancn instruction 
which is predicted to be taken, the selector 403 
selects the branch instruction of half word and 

20 provides the same to the branch instruction decod- 
ing circuit 404 and adder 405. If the instruction 
latch 401 is storing no branch instruction which is 
predicted to be taken, i.e., if all bits of the fields Bl- 
[0] to Bl[3] are each 0, the selector 403 provides all 

25 -r. 

The branch instruction decoding circuit 404 
receives an upper byte of the output of the selector 
403. If the upper byte coincides with one of opera- 
tion code field OPs of short format relative branch 

30 instructions, the circuit 404 asserts a branch ad- 
dress valid signal 107. The bit pattern of all "1" 
does not coincide with any operation code field 
OPs of the short format relative branch instruction. 
In this case, the circuit 404 doesn't asserts a 

35 branch address valid signal 107. 

The adder 405 adds a value obtainable by 
shifting a lower byte of the output of the selector 
403 by one bit to the left to a value of the fetching 
PC 402, and provides a result of the addition to a 

40 signal line 108. 

Figure 8 shows flows of a pipeline process with 
the sub-decoding circuit 102 carrying out branch 
prediction according to the present invention. 

In the figure, the sub-decoding circuit 102 and 

45 instruction buffer 101 of Fig. 3 simultaneously re- 
ceive a new instruction, and the sub-decoding cir- 
cuit 102 decodes the instruction in this cycle. Since 
the main decoding circuit 103 decodes first one of 
instructions stored in the instruction buffer 101, the 

50 sub-decoding circuit 102 and main decoding circuit 
103 simultaneously decode different instructions, 
respectively, in cycle 1 of Fig. 8. The decoding in 
the sub-decoding circuit 102 is carried out only to 
confirm a short format relative branch instructions 

55 Usually, before an instruction processed by the 

sub-decoding circuit 102 starts to be processed by 
the main decoding circuit 103, preceding instruc- 
tions stored in the instruction buffer 101 are pro- 

4 
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cessed in the main decoding circuit 103. In Figure 
6, preceding instruction 11 decodes in the main 
decoding circuit 1 03 in cycle 1 . An important thing 
is that a branch instruction Bt is followed, just after 
it, by a target instruction Tl. In this case, if branch 
prediction is correct, the pipeline process win not 
be disturbed by the branching. 

Figure 9 is a detailed circuit diagram showing 
the main decoding circuit 103 of Fig. 3. 

The main decoding circuit 103 comprises an 
instruction latch 701 that is a shift register, a de- 
coding PC 702, a selector 703. a decoding circuit 
704, a first adder 705, and a second adder 706. 
Instructions of three half words are given from the 
instruction buffer 101 to the "main decoding circuit 

IVW| CU IVi It I M • v • ••v%.wuv> >v>»w.. - — • - 

Among the instructions of three half words, a 
first half word including a field Bl[0] and a field 
INSTI0-15] is provided to the decoding circuit 704. 
The decoding circuit 704 converts the instruction 
into an internal instruction, which is supplied to an 
effective address computing portion through the 
signal line 109. The length of the decoded Instruc- 
tion is provided to the second adder 706. 

The second adder 706 adds the contents of the 
decoding PC 702, i.e., an address of the first half 
word of the instruction latch 701 to the decoded 
instruction length provided by the decoding circuit 
704. When the next instruction is supplied from the 
instruction buffer 101 to the main decoding circuit 
103, an output of the second adder 706 is stored in 
the decoding PC 702. 

If a branch is taken, an instruction fetch ad- 
dress is stored in the decoding PC 702 from an 
instruction fetching portion through the line 105. 

When the field Bl[0] is w 1" and the field INST- 
[0-15] coincides with a bit pattern of a general 
format relative branch instruction of two or three 
half words in length, the decoding circuit 704 pro- 
vides a branch address valid signal 110. At this 
moment, a branch address has been computed 
already by the selector 703 and first adder 705. 
The selector 703 selects a displacement included 
in the general format relative branch instruction. 
The size of the displacement is indicated by a 
section SS of bits 6 and 7 of the field INST[0-15] of 
the instruction. 

Figure 10 is a table showing relations between 
bit statuses of the field SS of the first half word and 
selected outputs of the selector 703. 

In Fig. 9, the first adder 705 adds a value of 
the decoding PC 702 to a displacement (bit values 
of corresponding one of the INST fields of Fig. 10) 
of th relative branch instruction that is a selected 
output of the selector 703, thereby finding a branch 
target address, which is supplied to a signal line 
111. 

Figure 1 1 is a view showing pipeline flows with 



the main decoding circuit 103 of Fig. 9 carrying out 
branch prediction. In cycle 1, the main decoding 
circuit 103 decodes a branch instruction Bl that has 
been predicted that branch is taken. In cycle 2, an 

s instruction Tl specified by the branch instruction is 
fetched, in cycle 3, the fetched instruction Ti is 
decoded by the main decoding circuit 103. Accord- 
ingly, only one cycle of the pipeline process may 
be disturbed between the branch instruction Bl and 

to the instruction Tl. 

As explained above, according to the embodi- 
ment of the present invention, the branch predic- 
tion information field Bl of one bit is provided for 
each half word, and it will be "1" if the branch 

75 prediction is taken, and "0" if it will be not taken. In 
this way. prediction is made for each branch in- 
struction. 

The field Bl may comprise more than one bit to 
hold a history of directions of several past 
20 branches to carry out branch prediction based on 
the history. 

The main decoding circuit 103 may be modi- 
fied to carry out branch prediction on an uncondi- 
tional branch instruction, which is one of short 

25 format relative branch instructions having a field Bl 
of "0" and on which the sub-decoding circuit 102 
may not carry out the branch prediction. 

Various modifications will become possible for 
those skilled in the art after receiving the teachings 

30 of the present disclosure without departing from the 
scope thereof. 

Claims 

35 

1 . A branch prediction control circuit comprising: 
a first register for holding at least one machine 
language instruction fetched from a memory; 
a second register for holding an address of the 

40 machine language instruction; 

a first detection circuit connected to said first regis- 
ter, for detecting that the first register contains a 
relative branch instruction that is predicted to be 
taken, and providing a branch address valid signal; 

45 and 

an adder for adding a displacement included in the 
relative branch instruction to the address held in 
said second register, and providing a branch ad- 
dress. 

so 2. The branch prediction control circuit as claimed 
in claim 1 , wherein said first register holds a plural- 
ity of machine language instructions, and said de- 
tection circuit carries out branch prediction by re- 
ferring to a branch information bit provided to each 

55 of the machine language instructions, the branch 
information bit indicating whether or not the cor- 
responding branch instruction is taken. 
3. The branch prediction control circuit as claimed 
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in claim 2, wherein said detection circuit com- 
prises: 

a selector for selecting, based on the branch in- 
formation bits, one of the machine language 
instructions that is predicted to branch; and s 
a uTafich instructi en vjCccdifty circuit for decoding 
an operation code of the instruction selected by the 
selector, and if the instruction is a short format 
relative branch instruction, providing a branch ad- 
dress valid signal. ro 

4. The branch prediction control circuit as claimed 
in claim 1 further comprising: 

an instruction buffer for holding a machine lan- 
guage instruction provided by the main memory at 
the time when the same instruction is provided to 75 
said first register; and 

a main decoding circuit for decoding first one of 
machine language instructions held in the instruc- 
tion buffer, the main decoding circuit comprising: 
a third register for holding the machine language 20 
instruction provided by the instruction buffer; 
a fourth register for holding an address of the 
machine language Instruction; 
a second detection circuit connected to the third 
register, for decoding the machine language in- 25 
struction held in the third register to detect that the 
machine language instruction is a relative branch 
instruction, and providing a branch address valid 
signal; and 

an adder for adding a disptecement included in the 30 
relative branch instruction to the address held in 
the fourth register, and providing a branch address. 

5. The branch prediction control circuit as claimed 
in claim 4, wherein the second detection circuit 
refers to a branch information bit which is added to 35 
the machine language instruction to indicate wheth- 
er or not the branch is taken, provides a signal 
indicating that the branch address provided by the 
adder is valid, and decodes an operation code of ' 

the machine language instruction into an interna) 40 
instruction. 
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® A branch prediction control circuit has a register. 
A main memory provides a machine language in- 
struction to the register and an instruction buffer 
(101). The prediction control circuit decodes the 
instruction held in the register and predicts whether 
or not the branch is taken. If the instruction is a 
relative branch instruction, a sum of the address of 
th instruction and a displacement included in the 
instruction is computed to provide a branch address. 
A the same time, a branch address valid signal is 
provided. With this arrangement, reliable branch pre- 
diction of the relative branch instruction is realized 
with a small number of hardware. 
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